Search results for "Log likelihood"

showing 1 items of 1 documents

Getting rid of the Chi-square and Log-likelihood tests for analysing vocabulary differences between corpora

2018

Log-likelihood and Chi-square tests are probably the most popular statistical tests used in corpus linguistics, especially when the research is aiming to describe the lexical variations between corpora. However, because this specific use of the Chi-square test is not valid, it produces far too many significant results. This paper explains the source of the problem (i.e., the non-independence of the observations), the reasons for which the usual solutions are not acceptable and which kinds of statistical test should be used instead. A corpus analysis conducted on the lexical differences between American and British English is then reported, in order to demonstrate the problem and to confirm …

Linguistics and LanguageVocabularybusiness.industryComputer sciencemedia_common.quotation_subjectBritish EnglishLog likelihoodcomputer.software_genreLanguage and Linguisticslanguage.human_languageTest (assessment)SoftwareCorpus linguisticslanguageChi-square testArtificial intelligencebusinesscomputerNatural language processingStatistical hypothesis testingmedia_common
researchProduct